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Abstract. An important task in the analysis of multiagent systems is to understand how groups of selfish players 
can form coalitions, i.e., work together in teams. In this paper, we study the dynamics of coalition formation under 
bounded rationality. We consider settings where each team's profit is given by a convex function, and propose 
three profit-sharing schemes, each of which is based on the concept of marginal utility. The agents are assumed 
to be myopic, i.e., they keep changing teams as long as they can increase their payoff by doing so. We study the 
properties (such as closeness to Nash equilibrium or total profit) of the states that result after a polynomial number 
of such moves, and prove bounds on the price of anarchy and the price of stability of the corresponding games. 

1 Introduction 

Cooperation and collaborative task execution are fundamentally important both for human soci- 
eties and for multiagent systems. Indeed, it is often the case that certain tasks are too complicated 
or resource-consuming to be executed by a single agent, and a collective effort is needed. Such 
settings are usually modeled using the framework of cooperative games, which specify the amount 
of payoff that each subset of agents can achieve: when the game is played the agents split into 
teams (coalitions), and the payoff of each team is divided among its members. 

The standard framework of cooperative game theory is static, i.e., it does not explain how the 
players arrive at a particular set of teams and a payoff distribution. However, understanding the dy- 
namics of coalition formation is an obviously important issue from the practical perspective, and 
there is an active stream of research that studies bargaining and coalition formation in cooperative 
games (see, e.g. HCDS93IMW95IO96IY03II ). Most of this research assumes that the agents are fully 
rational, i.e., can predict the consequences of their actions and maximize their (expected) utility 
based on these predictions. However, full rationality is a strong assumption that is unlikely to hold 
in many real-life scenarios: first, the agents may not have the computational resources to infer their 
optimal strategies, and second, they may not be sophisticated enough to do so, or lack informa- 
tion about other players. Such agents may simply respond to their current environment without 
worrying about the subsequent reaction of other agents; such behavior is said to be myopic. Now, 
coalition formation by computationally limited agents has been studied by a number of researchers 
in multi-agent systems, starting with the work of [SK99| and HSL971 . However, myopic behavior 
in coalition formation received relatively little attention in the literature (for some exceptions, see 
[DS02 CB04 AS09|). In contrast, myopic dynamics of non-cooperative games is the subject of a 
growing body of research (see, e.g. HFPT04IAAE+08IFFM08II ). 

In this paper, we merge these streams of research and apply techniques developed in the context 
of analyzing the dynamics of non-cooperative games to coalition formation settings. In doing so, 
we depart from the standard model of games with transferable utility, which allows the players 



in a team to share the payoff arbitrarily: indeed, such flexibility will necessitate a complicated 
negotiation process whenever a player wants to switch teams. Instead, we consider three payoff 
models that are based on the concept of marginal utility, i.e., the contribution that the player makes 
to his current team. Each of the payoff schemes, when combined with a cooperative game, induces 
a non-cooperative game, whose dynamics can then be studied using the rich set of tools developed 
for such games in recent years. 

We will now describe our payment schemes in more detail. We assume that we are given a 
convex cooperative game, i.e., the values of the teams are given by a submodular function; the 
submodularity property means that a player is more useful when he joins a smaller team, and plays 
an important role in our analysis. In our first scheme, the payment to each agent is given by his 
marginal utility for his current team; by submodularity, the total payment to the team members 
never exceeds the team's value. This payment scheme rewards each agent according to the value 
he creates; we will therefore call these games Fair Value games. Our second scheme takes into 
account the history of the interaction: we keep track of the order in which the players have joined 
their teams, and pay each agent his marginal contribution to the coalition formed by the players 
who joined his current team before him. This ensures that the entire payoff of each team is fully 
distributed among its members. Moreover, due to the submodularity property a player's payoff 
never goes down as long as he stays with the same team. This payoff scheme is somewhat remi- 
niscent of the reward schemes employed in industries with strong labor unions; we will therefore 
refer to these games as Labor Union games. Our third scheme can be viewed as a hybrid of the 
first two: it distributes the team's payoff according to the players' Shapley values, i.e., it pays each 
player his expected marginal contribution to a coalition formed by its predecessors when players 
are reordered randomly; the resulting games are called Shapley games. 

Our contributions We study the equilibria and dynamics of the three games described above. 
We are interested in the properties of the states that can be reached by natural dynamics in a 
polynomial number of steps: in particular, whether such states are (close to) Nash equilibria, and 
whether they result in high total productivity, i.e., the sum of the teams' values (note that in Fair 
Value games the latter quantity may differ from the social welfare, i.e., the sum of players' payoffs). 

We first show that all our games are potential games, and hence admit a Nash equilibium in 
pure strategies. We then argue that for each of our games the price of anarchy is bounded by 2. For 
the first two classes of games, we can also bound their a-price of anarchy, i.e., the ratio between the 
total profit of the optimal coalition structure and that of the worst a-Nash equilibrium, by 2 + a. We 
also provide bounds on the price of stability for all three games. Further, for the first two classes of 
games, we show that the basic Nash dynamic converges in polynomial time to an approximately 
optimal state, where the approximation ratio is arbitrarily close to the price of anarchy; these results 
extend to basic a-Nash dynamic and a-price of anarchy. To obtain these results, we observe that 
both the Fair Value games and the Labor Union games can be viewed as variants of /3-nice games 



introduced in [AAE + 08|, and prove general convergence results for such games, which may be of 



independent interest. We then show that Labor Union games have additional desirable properties: 
in such games a-Nash dynamics quickly converges to a-Nash equilibrium; also, if we start with 
the state where each player is unaffiliated, the Nash dynamics converges to a Nash equilibrium 
after each player gets a chance to move. 
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The rest of the paper is organized as follows. After a brief overview of the related work, we 
provide the required preliminaries in Section |2j Section [3] deals with /3-nice games and lays the 
groundwork that will be necessary for the technical results in the next section. Then, in Section [4j 
we describe our three classes of games and present our results for these games. Section [5] explains 
the relationship between our games and the well-studied cut games. Section [6]presents our conclu- 
sions and directions for future work. 

Related Work The games studied in this paper belong to the class of potential games, 
introduced by Monderer and Shapley [MS96|. In potential games, any sequence of im- 
provements by players converges to a pure Nash equilibrium. However, the number of 
steps can be exponential in the description of the game. The complexity of computing 
(approximate) Nash equilibrium in various subclasses of potential games such as conges- 
tion games [Ros73J, cut games HSY91B or party affiliation games HFPT04II has received a 
lot of attention in recent years HPS J8 8IFPT04ICMS06IS V08 ITsc 1 OIBCKlOl . A related is- 
sue is how long it takes for some form of best response dynamics to reach an equilib- 
rium HMV04IGMV05ICS07IARV08ISV08IAAE+08H . Even if a Nash equilibrium cannot be 
reached quickly, a state reached after a polynomial number of steps may still have high social 
welfare; this question is studied, for example, in [CMS06 FFM08 FM091. 

A recent paper by Gairing and Savani [GS10J studies the dynamics of a class of cooperative 
games known as additively separable hedonic games; their focus is on the complexity of comput- 
ing stable outcomes. While the class of all convex cooperative games considered in this paper is 
considerably broader than that of additively separable games, paper HGS10I also studies notions of 
stability not considered here. 

2 Preliminaries 

Non-cooperative games. A non-cooperative game is defined by a tuple Q = 

(N, (Ei) ieN , (ttt)iejv), where N = {1, 2, . . . , n} is the set of players, Si is the set of (pure) strate- 
gies of player i, and : x ieN Si — ► IR + U {0} is the payoff function of player i. 

Let S = x i( z N Si be the strategy profile set or state set of the game, and let S = 
(s%, S2, ■ ■ ■ , s n ) G S be a generic state in which each player i chooses strategy Sj G Si. Given 
a strategy profile S = (si, S2, ■ ■ ■ , s n ) and a strategy G Si, let (S-i, s'j) be the strategy 
profile obtained from S by changing the strategy of player i from Sj to s^, i.e., (S-i, s£) = 
(Sl, $2, ■ ■ ■ , Sj, . . . , s n ). 

Nash equilibria and dynamics. Given a strategy profile S = (si, s 2 , . . . , s n ), a strategy s' ri G Si 
is an improvement move for player i if U{(S-i, s'j) > Ui(S); further, is called an a-improvement 
move for i if Ui(S-i, s-) > (1 + a)ui(S), where a > 0. A strategy G Si is a best response 
for player i in state S if it yields the maximum possible payoff given the strategy choices of the 
other players, i.e., Ui(S_i, s b { ) > Ui(S-i, s-) for any s- G S { . An a-best response move is both an 
a-improvement and a best response move. 

A (pure) Nash equilibrium is a strategy profile in which every player plays her best response. 
Formally, S = (s\, s 2 , . . . , s n ) is a Nash equilibrium if for alH G iV and for any strategy G Si 
we have Ui(S) > Ui(S-i, sQ. We denote the set of all (pure) Nash equilibria of a game Q by 
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J\f£{Q). A profile S = (si, . . . , s n ) is called an a-Nash equilibrium if no player can improve his 
payoff by more than a factor of (1 + a) by deviating, i.e., (1 + a)ui(S) > Ui(S-i, sj) for any i E N 
and any G The set of all a-Nash equilibria of Q is denoted by J\f£ a (Q). In a strong Nash 
equilibrium, no group of players can improve their payoffs by deviating, i.e., 5* = (si, . . . , s n ) is 
a strong Nash equilibrium if for all I C iV and any strategy vector 5" = (s^, . . . , s'„) such that 
s'j = Sj for i G N \ I, if Ui(S') > Ui(S) for some i £ I, then Uj(S') < Uj(S) for some j G /. 

Let Z\j(S') be the improvement in the player's payoff if he performs his best response, i.e., 
Ai(S) = Ui(S_i, s^)-Ui(S), where s\ is the best response of player i in state S. For any Z C iV let 
Az(S) = ^2 i£Z Ai(S), and let A(S) = A N (S). A Nash dynamic (respectively, a-Nash dynamic) 
is any sequence of best response (respectively, a-best response) moves. A basic Nash dynamic 
(respectively, basic a-Nash dynamic) is any Nash dynamic (respectively, a-Nash dynamic) such 
that at each state S the player i that makes a move has the maximum absolute improvement, i.e., 
i G argmaxjgjv Aj(S). 

Price of anarchy. Given a game Q with a set of states S, and a function / : E — > IR + U {0}, 
we write Opt f(Q) = max SeS f(S). The price of anarchy PoAj(Q) and the price of stabil- 
ity PoS/(C?) of a game Q with respect to a function / are, respectively, the worst-case ra- 
tio and the best-case ratio between the value of / in a Nash equilibrium and OPTf(Q), i.e., 

— maxsetf£(g) ° f(s)^ > P°S/(£) — min-s&tf£(<g) ° f(s)^ '• Tne strong price of anarchy 
and the strong price of stability are defined similarly; the only difference is that the maximum 
(respectively, minimum) is taken over all strong Nash equilibria. Further, the a-price of anarchy 

Opt (Q^) 

PoA^(^) of a game Q with respect to / is defined as PoA" (Q) = max Se ^ f "(g) j£k , the a-price 
of stability PoS"(^) can be defined similarly. Originally, these notions were defined with respect 
to the social welfare function, i.e., / = ^ i( z N Ui(S). However, we give a more general definition 
since in the setting of this paper it is natural to use a different function /. We omit the index / 
when the function / is clear from the context. 

Potential games. A non-cooperative game Q is called a potential game if there is a function 
<P : £ — > N such that for any state S and any improvement move of a player i in S we have 
4>(S-i, — <t>(S) > 0; the function <P is called the potential function of Q. The game Q is called 
an exact potential game if we have <&(S-i, s[) — <P(S) = Ui(S^, h s-) — Ui(S). It is known that any 
potential game has a pure Nash equilibrium HMS96|Ros73H . 

Cooperative games. A cooperative game G = (N, v) is given by a set of players N and a 
characteristic function v : 2 N — > IR + U {0} that for each set / C iV specifies the profit that 
the players in / can earn by working together. We assume that t>(0) = 0. A coalition structure 
over iV is a partition of players in N, i.e., a collection of sets Ji, . . . , Ik such that (i) Jj C iV for 
i = 1, . . . , fc; (ii) Ii n Ij = for all i < j < k; and (iii) Uj =1 /,- = iV. A game G = (JV, u) 
is called monotone if f is non-decreasing, i.e., < v(J) for any I C J C N. Further, G 
is called convex if v is submodular, i.e., for any I C J ^ N and any i G N \ J we have 
u(J U {«}) — > f ( J U {«}) —v(J). Informally, in a convex game a player is more useful when 
he joins a smaller coalition. We will make use of the following property of submodular functions. 



4 



Lemma 1. Let f : 2 — > R be a submodular function. Then for any pair of sets X, Y C V 
such that X fl Y = and X = {xi,xz, . . . , Xk}, it holds that J2j=i k (fO^ U ~ /OO) — 
f(YUX)-f(Y). 

Proof Since / is a submodular function, for every xj E X we have 

/(y U {x,}) - f(Y) > f(Y U{x u x 2 ,..., x^, Xj}) - f(Y U{ Xl ,x 2 ,..., x^}). 
The lemma now follows by summing these inequalities for all j = 1, . . . , k. □ 

3 Perfect /3-nice Games 

In this section, we define the class of perfect /3-nice games (our definition is inspired by [|AAE + 08l . 
but differs from the one given there), and prove a number of results for such games. Subsequently, 
we will show that many of the profit-sharing games considered in the paper belong to this class. 
Most proofs in this section are relegated to Appendix [A| 

Definition 1. A potential game Q with a potential function <P is called perfect with respect to a 
function f : E — > IR + U {0} if for any state S it holds that f(S) > J2ieN u i(S)> an d, moreover, 
for any improvement move s\ of player i we have 

$ - f(S) > <P(S_ t , $ - <P(S) > Ul (S^, $ - Ui (S). 

Also, a game Q is called /3-nice with respect to f if for every state S we have (3 ■ f(S) + A(S) > 
OPT/0?). 

We can bound the price of anarchy of a /3-nice game by (3. 

Lemma 2. For any f : E — > IR + U {0} and any game Q that is (3-nice w.r.t. f we have PoA/(£) < 
P. 

Proof. The lemma follows by observing that for any Nash equilibrium S we have A(S) < 0. □ 
Lemma [2] can be extended to a -price of anarchy for any a > 0. 

Lemma 3. For any f : E — > IR + U {0}, any a > 0, and any game Q that is (3-nice w.r.t. f we have 
PoA?(£) < a + f3. 

Proof. For any a-Nash equilibrium S we have A(S) < a J2 ieN Ui(S) < af(S). □ 
We now state a technical lemma that we use shortly in proving Theorem [T] 

Lemma 4. Consider any non-cooperative game Q and any function f : E — > R + U {0}. For 
positive values of e, a, and b, any dynamic for which the increase in the value of f at a step leading 
from S to S is at least b — -f(S) converges to a state S F with f(S F ) > ab(l — e) in at most 
a In -1 steps, from any initial state. 
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The next theorem states that after a polynomial number of steps, for every perfect /3-nice potential 
game, the basic Nash dynamic reaches a state whose relative quality (with respect to /) is close to 
the price of anarchy. 

Theorem 1. Consider any function f : E — > R + U {0} and any game Q that is perfect (3-nice 
with respect to f. For any e > the basic Nash dynamic converges to a state S F with f(S F ) > 
° PT p^ (1 — e) in at most ["5 In i] steps, starting from any initial state. 

Proof. Consider a generic state S of the dynamic. Since Q is /3-nice, we have A(S) > Opt f(Q) — 
(3 ■ f(S). Let i be the player moving in state S, and let S be the state resulting from the move of 
player i. Since i is the player with the maximum absolute improvement, we get 

Hs) - m > m - m > ms) > ^ > °*iM^m . 

The theorem now follows by applying Lemma |4j with b = and a — j|. □ 

A convergence result similar to Theorem[TJcan be obtained for basic a-Nash dynamic. 

Theorem 2. Consider any function f : E — > IR + U {0} and any game Q that is perfect (5 -nice with 
respect to f. For any e > and any a > the basic a-Nash dynamic converges to a state S F with 
f(S F ) > °ff+,lf (1 _ e ) in at most In |] steps, starting from any initial state. 

4 Profit-sharing games 

In this section, we study three non-cooperative games that can be constructed from an arbitrary 
monotone convex cooperative game. 

Each of our games can be described by a triple Q = (N, v, M), where (N, v) is a monotone 
convex cooperative game with iV = {1, . . . , n}, and M = {1, . . . , m} is a set of m parties; we 
require m < n. All three games considered in this section model the setting where the players 
in N form a coalition structure over iV that consists of m coalitions. Thus, each player needs 
to choose exactly one party from M, i.e., for each i E N we have Si = M. In some cases 



(see Section 4.2), we also allow players to be unaffiliated. To model this, we expand the set of 
strategies by setting Ei = M U {0}. Intuitively, the parties correspond to different companies, 
and the players correspond to the potential employees of these companies; we desire to assign 
employees to companies so as to maximize the total productivity. 

In two of our games (see Section 4.1 and Section [43] ), a state of the game is completely de- 



scribed by the assignment of the players to the parties, i.e., we can write S — (si, . . . , s n ), where 
Si E M for all i E N. Alternatively, we can specify a state of the game by providing a partition of 
the set N into m components Qi, . . . , Q m , where Qj is the set of all players that chose party j, i.e., 
we can write S = (Qi, . . . , Q m ); we will use both forms of notation throughout the paper. In the 
game described in Section 4.2[ the state of the game depends not only on which parties the players 



chose, but also on the order in which they joined the party; we postpone the formal description 



of this model till Section 4.2 In all three models, each player's payoff is based on the concept of 



marginal utility; however, in different models this idea is instantiated in different ways. 
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An important parameter of a state S = (Qi, . . . , Q m ) in each of these games is its total profit 
tp(S') = YljeM v (Qj)- While for the games defined in Section 4.2 and Section 4.3 the total profit 



coincides with the social welfare, for the game described in Section 4.1 this is not necessarily the 
case. As we are interested in finding the most efficient partition of players into teams, we consider 
the total profit of a state a more relevant quantity than its social welfare. Therefore, in what follows, 
we will consider the price of anarchy and the price of stability with respect to the total profit, i.e., 
we have Opt(£) = OPT tp (£), PoA(£) = PoA tp (£), PoS(£) = PoS tp (£). 

All of our results generalize to the setting where each party j 6 Mis associated with a different 
non-decreasing submodular profit function vj : 2 N — » IR + U {0}, i.e., different companies possess 
different technologies, and therefore may have different levels of productivity. Formally, any such 
game is given by a tuple Q = (N, v i, . . . , v m , M), where M = {1, . . . , m}, and for each j E M the 
function Vj is a non-decreasing submodular function Vj : 2^ — > IR + U {0} that satisfies v (0) = 0. In 
this case, the total profit function in a state S = (Q%, . . . , Q m ) is given by tp(S') = YljeM v j(Qj)- 
In what follows, we present our results for this more general setting. 



4.1 Fair Value games 

In our first model, the utility Ui(S) of a player i in a state S = (Q±, . . . , Q m ) is given by z's marginal 
contribution to the coalition he belongs to, i.e., if i E Qj, we set Ui(S) = Vj(S)— Vj(S\{i}). As this 
payment scheme rewards each player according to the value he creates, we will refer to this type of 
games as Fair Value games. Observe that since the functions Vj are assumed to be submodular, we 
have J2 ie Q Ui(S) < Vj(Qj) for all j E M, i.e., the total payment to the employees of a company 
never exceeds the profit of the company. Moreover, it may be the case that the profit of a company 
is strictly greater than the amount it pays to its employees; we can think of the difference between 
the two quantities as the owner's/shareholders' value. Consequently, in these games the total profit 
of all parties may differ from the social welfare, as defined in Section |2j 

We will now argue that Fair Value games have a number of desirable properties. In particular, 
any such game is a potential game, and therefore has a pure Nash equilibrium. The proof of the 
following theorem can be found in Appendix |Bj 

Theorem 3. Every Fair Value game Q is a perfect 2-nice exact potential game w.r.t. the total profit 
function. 

Combining Theorem [3j Lemmas [2] and [3] and Theorems [T] and [2j we obtain the following corollar- 
ies. 

Corollary 1. For every Fair Value game Q and every a > we have PoA Q (£/) < 2 + a. In 
particular, PoA(^) < 2. 

Corollary 2. For every Fair Value game Q and any e > 0, the basic Nash dynamic ( respectively, 
the basic a-Nash dynamic) converges to a state S F with total profit tp(S F ) > 0pT ^ e > (1 — e ) 

(respectively, tp(S F ) > — e)) in at most [f ln|] steps (respectively, [2+a m el ste P s )> 

from any initial state. 
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Since every Fair Value game is an exact potential game with the potential function given by the total 
profit, any profit-maximizing state is necessarily a Nash equilibrium. This implies the following 
proposition. 

Proposition 1. For any Fair Value game Q we have PoS(C?) = 1. 
4.2 Labor Union Games 

In Fair Value games, the player's payoff only depends on his current marginal value to the enter- 
prise, i.e., one's salary may go down as the company expands. However, in many real-life settings, 
this is not the case. For instance, in many industries, especially ones that are highly unionized, an 
employee that has spent many years working for the company typically receives a higher salary 
than a new hire with the same set of skills. Our second class of games, which we will refer to as 
Labor Union games, aims to model this type of settings. Specifically, in this class of games, we 
modify the notion of state so as to take into account the order in which the players have joined 
their respective parties; the payment to each player is then determined by his marginal utility for 
the coalition formed by his predecessors. The submodularity property guarantees than a player's 
payoff never goes down as long as he stays with the same party. 

Formally, in a Labor Union game Q that corresponds to a tuple (N, v±, . . . , v m , M), we allow 
the players to be unaffiliated, i.e., for each i E N we set Si = M U {0}. If player i plays strategy 0, 
we set his payoff to be irrespective of the other players' strategies. A state of Q is given by a tuple 
V = (Pi, . . . , P m ), where Pj is the sequence of players in party j, ordered according to their arrival 
time. As before, the profit of party j is given by the function Vj\ note that the value of Vj does not 
depend on the order in which the players join j . The payoff of each player, however, is dependent on 
their position in the affiliation order. Specifically, for a player i E Pj, let Pj (i) be the set of players 
that appear in Pj before i. Player z's payoff is then defined as Ui(V) = Vj(Pj(i) U {i}) — Vj(Pj(i)). 

We remark that, technically speaking, Labor Union games are not non-cooperative games. 
Rather, each state of a Labor Union game induces a non-cooperative game as described above; 
after any player makes a move, the induced non-cooperative game changes. Abusing terminology, 
we will say that a state V of a Labor Union game Q is a Nash equilibrium if for each player i E N 
staying with his current party is a best response in the induced game; all other notions that were 
defined for non-cooperative games in Section |2} as well as the results in Section [3j can be extended 
to Labor Union games in a similar manner. 

We now state two fundamental properties of our model. 

- Guaranteed payoff: Consider two players i and i' in Pj. Suppose i' moves to another party. 
The payoff of player i will not decrease. Indeed, if i' succeeds i in the sequence Pj, then by 
definition, i's payoff is unchanged. If i' precedes i in Pj, then, since Vj is non-decreasing and 
submodular, i's payoff will not decrease; it may, however, increase. 

- Full payoff distribution: The sum of the payoffs of players within a party j is a telescopic 
sum that evaluates to Vj(Pj). Therefore, the total profit tp('P) = ^j eM Vj(Pj) in a state V 
equals to the social welfare in this state. In other words, in Labor Union games, the profit of 
each enterprise is distributed among its employees, without creating any value for the own- 
ers/shareholders. 
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The guaranteed payoff property distinguishes the Labor Union games from the Fair Value games, 
where a player who maintains his affiliation to a party might not be rewarded, but may rather see 
a reduction in his payoff as other players move to join his party. This, of course, may incentivize 
him to shift his affiliation as well, leading to a vicious cycle of moves. In contrast, in Labor Union 
games, a player is guaranteed that his payoff will not decrease if he maintains his affiliation to a 
party. This suggests that in Labor Union games stability may be easier to achieve. In what follows, 
we will see that this is indeed the case. 

We will first show that Labor Union games are perfect 2-nice with respect to the total profit 
(or, equivalently, social welfare); this will allow us to apply the machinery developed in Section [3] 
Abusing notation, let Ai(V) denote the improvement in the payoff of player i if he performs a best 
response move from V, and let A(V) = J2 ieN Ai(V). 

Proposition 2. Any Labor Union game Q is a perfect 2-nice game with respect to the total profit 
function. 

Proof. It is easy to see that Q is a potential game with the potential function 4>(V) = tp(V). 
Furthermore, for any player i the increase in his payoff when he performs an improvement move 
does not exceed the change in the total profit. It remains to show that 2tp('P) + A(V) > Opt(G) 
for any V = (Pi, . . . , P m ). We have 

VjiQi) < U Oj) = VjiP,) + Vj(Pj U 3 ) - ViiP s ) < v s {P s ) + M p <) + MP))- 

iao 3 \P 3 

Summing over all parties, we obtain 

Ow{g) = Y i Vi{O s )<Y t v i {Pi) + Y i E <^) + E E A(P)<2tp(V) + A(V). 

jeM jeM jeM iaOj\Pj jeM ieOj\Pj 

□ 

As in the case of Fair Value games, Proposition [2] allows us to bound the price of anarchy of any 
Labor Union game, as well as the time it takes to converge to a state with a "good" total profit. 

Corollary 3. For every Labor Union game Q and every a > we have PoA a ((?) < 2 + a. In 
particular, PoA(^) < 2. 

Corollary 4. For every Labor Union game Q and any e > 0, the basic Nash dynamic ( respectively, 
the basic a-Nash dynamic) converges to a state S F with total profit tp(S F ) > ° PT j g ^ (1 — e ) 

(respectively, tp(S F ) > °^T^ (1 — e)J in at most [| In |] steps (respectively, |^2+a m el ste P s )> 
from any initial state. 

Let 0(G) = (Ox, . . . , O m ) be a state that maximizes the total profit in a game G, and let 
Opt(^) = tp(0(G))- As in the case of Fair Value games, it is not hard to see that 0(G) is a Nash 
equilibrium, i.e., PoS(^) = 1. In fact, for Labor Union games, we can prove a stronger statement. 

Proposition 3. In any Labor Union game G, 0(G) is a strong Nash equilibrium. I.e., the strong 
price of stability is 1. 
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Proof. Consider a deviating coalition / C N. By the guaranteed payoff property, the deviation 
does not lower the payoff of all players in N \ I and increases the payoff of some of the deviators, 
without harming the rest of the deviators. Thus, the deviation must lead to a state whose total 
payoff exceeds that of 0(G), a contradiction. □ 

Furthermore, for Labor Union games we can show that for certain dynamics and certain initial 
states one can guarantee convergence to a-Nash equilibrium or even Nash equilibrium. 

Proposition 4. Consider any Labor Union game G = (N, v i, . . . , v m , M) such that Vj(I) > I for 
any j G M and any I E 2 N \ {0}. For any such G, the a-Nash dynamic starting from any state in 
which all players are affiliated with some party converges to an a-Nash equilibrium in 0(- log W) 
steps, where W is the maximum payoff that any player can achieve. 

Proof. After each move in the a-Nash dynamic, a player improves her payoff by a factor of 1 + a, 
and the guaranteed payoff property ensures that payoffs of other players are unaffected. So, if a 
player starts with a payoff of at least 1, she will reach a payoff of W after 0( x ^^-) steps. Therefore, 
in 0(- log W) steps, we are guaranteed to reach an a-Nash equilibrium. □ 

Proposition 5. Suppose a Labor Union game G with n players starts at a state in which every 
player is unaffiliated. Then, in exactly n steps of the Nash dynamic, the system will reach a Nash 
equilibrium. 

Proof. The proof is by induction on the number of steps. The very first player who gets to move will 
pick the party that maximizes her payoff. Subsequently, she will never have an incentive to move, 
because no move will give her any improvement in her payoff. For the inductive step, suppose that 
k — 1 steps have elapsed, and exactly k — 1 players have moved once each and have reached their 
final destination with no incentive to move again. The player who moves at step k chooses his best 
response party. Since the profit functions are increasing and submodular, he cannot improve his 
payoff by moving to another party at a later step. Therefore, in n steps, the system reaches a Nash 
equilibrium. □ 

We conclude with an important open question. We have shown that for a > 0, the a-Nash 
dynamic leads to an a-Nash equilibrium in 0(^ log W) steps. However, we do not know whether 
there exists a dynamic that converges to a Nash equilibrium in a number of steps that is a polyno- 
mial in n and log W. 

4.3 Shapley games 

In our third class of games, which we call Shapley games, the players' payoffs are determined in 
a way that is inspired by the definition of the Shapley value HS53H . Like in Fair Value games, a 
state of a Shapley game is fully described by the partition of the players into parties. Given a state 
S = (Qi, • • • , Qm) and a player i G Qj, we define player i's payoff as 



u i( s )= 2^ \nl\ ( v j(Qu{i\) -Vj(Q)). 
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Intuitively, the payment to each player can be viewed as his average payment in the Labor Union 
model, where the average is taken over all possible orderings of the players in the party. This 
immediately implies J2i<=Q u i(S) = Vj(Qj)- Thus, Shapley games share features with both the 
Fair Value games and the Labor Union games. Like Fair Value games, the order in which the 
players join the party is unimportant. Moreover, if all payoff functions are additive, i.e., we have 
Ui(S U {j}) — Ui(S) = Ui({j}) for any i 6 N and any S C N \ {i}, then the respective Shapley 
game coincides with the Fair Value game that corresponds to (N, t> 1; . . . , v m , M). On the other 
hand, similarly to the Labor Union games, the entire profit of each party is distributed among its 
members. We will first show that any Shapley game is an exact potential game and hence admits a 
Nash equilibrium in pure strategies (all proofs in this section are deferred to Appendix [C]). 

Theorem 4. Any Shapley game Q = (N, v i, . . . , v m , M), is an exact potential game with the 
potential function given by 



Just like in other profit-sharing games, the price of anarchy in Shapley games is bounded by 2. 

Theorem 5. In any Shapley game Q = (N, v i, . . . , v m , M) with \N\ = n, we have PoA(£?) < 
2-K 

n 

The following claim shows that the bound given in Theorem [5] is almost tight. 

Proposition 6. For any n > 3, there exists a Shapley game Q = (N, v i, v 2, M) with \N\ = n and 
\M\ = 2 such that PoA(£) = 2 - ^ and PoS(£) = 2-^. 

5 Cut Games and Profit Sharing Games 

We will now describe a family of succinctly representable profit-sharing games that can be de- 
scribed in terms of undirected weighted graphs. It turns out that while two well-studied classes 
of games on such graphs do not induce profit-sharing games, a "hybrid" approach does. We then 
explain how to compute players' payoffs in the resulting profit-sharing games. 

In the classic cut games US Y9 1 IFPT04IC MS06 1 . players are the vertices of a weighted graph 
G = (N, E). The state of the game is a partition of players into two parties, and the payoff of 
each player is the sum of the weights of cut edges that are incident on him. A cut game naturally 
corresponds to a coalitional game with the set of players N, where the value of a coalition S C N 
equals to the weight of the cut induced by S and N\S. However, this game is not monotone, so it 
does not induce a profit-sharing game, as defined in Section |4j 

In induced subgraph games HDP94L the value of a coaliton S equals to the total weight of all 
edges that have both endpoints in S; while these games are monotone, they are not convex. 

Finally, consider a game where the value of a coalition S C N equals the total weight of all 
edges incident on vertices in S, i.e., both internal edges of S (as in induced subgraph games) and 
the edges leaving S (as in cut games). It is not hard to see that this game is both monotone and 
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convex, and hence induces a profit-sharing game as described in Section |4j We will now explain 
how to compute players' payoffs in the corresponding Fair Value games, Labor Union games and 
Shapley games, using Figure [T] In this figure, we are given a state of the game with two parties S 
and N \ S; the players are listed from top to bottom in the order in which they (last) entered each 
party. (The order is relevant only in Labor Union games.) A (resp., B) denotes the total weight of 
edges incident on i that connect i to a predecessor (resp., successor) within the party. C is the total 
weight of the cut edges incident on i. One can interpret an edge e = (i, i') with weight w(e) as a 
skill or resource of value proportional to w(e) that both i and i' possess. 

Fair Value Games: The payoff of i (see Figure [I]) is given by + C. Intuitively, the unique 
skills of a player are weighted more toward his payoff than his shared skills. 

Labor Union Games: The payoff of i is given by B + C. Intuitively, i's payoff reflects the unique 
skills that i possessed when he joined the party. Players who share skills with i, but join after i, 
will not get any payoff for those shared skills. 

Shapley Games: One can show that z's payoff is given by + C, just as in Fair Value games. 

One can see that this interpretation easily extends to multiple parties and hyperedges. We also note 
that many of the notions that we have discussed are naturally meaningful in this variant of the cut 
game: for instance, an optimal state for m = 2 is a configuration in which the weighted cut size is 
maximized. 



N\S 



Order of arrival 




Fig. 1. The set N of players is partitioned into parties S and N\S. Consider a player i. A (resp., B) 
denotes the total weight of edges incident on i and connecting i to a predecessor (resp., successor) 
within the party. C is the total weight of the cut edges incident on i. 



6 Conclusions and Future Work 

In this paper, we studied the dynamics of coalition formation under marginal contribution-based 
profit division schemes. We have introduced three classes of non-cooperative games that can be 
constructed from any convex cooperative game. We have shown that all three profit distribution 
schemes considered in this paper have desirable properties: all three games admit a Nash equilib- 
rium, and even the worst Nash equilibrium is within a factor of 2 from the optimal configuration. 
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In addition, for Fair Value games and Labor Union games a natural dynamic process quickly con- 
verges to a state with a fairly high total profit. Thus, when rules for sharing the payoff are fixed in 
advance, we can expect a system composed of bounded-rational selfish players to quickly converge 
to an acceptable set of teams. 

Of course, the picture given by our results is far from complete; rather, our work should be seen 
as a first step towards understanding the behavior of myopic selfish agents in coaliton formation 
settings. In particular, our results seem to suggest that keeping track of the history of the game and 
distributing payoffs in a way that respects players "seniority" leads to better stability properties; it 
would be interesting to see if this observation is true in practice, and whether it generalizes to other 
settings, such as congestion games. 

In contrast to the previous work on cost-sharing and profit-sharing games, our work does not 
assume that the game's payoffs are given by an underlying combinatorial structure. Rather, our 
results hold for any convex cooperative game, and, in particular, do not depend on whether it is 
compactly representable. Further, all of our results are non-computational in nature. Indeed, since 
the standard representation of cooperative games is exponential in the number of players, one 
can only hope to obtain meaningful complexity results for subclasses of cooperative games that 
possess a succinct representation; identifying such classes and proving complexity results for them 
is a promising research direction. 

In our study of Labor Union games, we took a somewhat unusual modeling approach: we 
considered a system described by a sequence of states, each of which induces a non-cooperative 
game, and proved convergence results about the dynamics of such systems. This approach can 
be extended to other classes of games such as, e.g., congestion games; indeed, there are real-life 
systems where a player's payoff depends on who selected a certain resource before him. It would 
be interesting to see if the known results for congestion games extend to this setting. 
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A Proofs for Section |3] 



A. 1 Proof of Lemma |4] 

From the hypothesis we have f(S) - f(S) >b - l/(S r ). Let h(S) = b- i/(S r ). Then 



Hence, 



h(S)-h(S) = -(f(B)-f{S))>-h(S). 

a a 



h(S)< (1-1) Z^S-). (1) 



Consider a state S F that is reached by the dynamic starting from a state S 1 in t steps. By recursively 
applying ([]}, we get 

By setting i = [a In ^P-] < [a In -~\ in the previous inequality, we derive that h(S F ) < eb. Thus 
we obtain f{S F ) = ab (l - > a6(l - e). □ 

A.2 Proof of Theorem |2] 

Let us consider a generic state 5 = (si, . . . , s n ) of the dynamic. Let (7 C iV be the subset of 
players that can perform an a-best-response move, and let E = N \ U. Note that no player i E E 
can improve his payoff by more than a factor of 1 + a by deviating from his current strategy, i.e., 
Ae(S) < a J2ieE u i(S) ^ a f(S). By definition of a perfect /3-nice game, we have 

A E (S) + A V {S) = A{S) > OPJfiG) - (3 ■ f(S). 

Let i be the player moving in state S, and let S be the state resulting from the move of player 
i E U. Since i is the player with the maximum absolute improvement among the players in U, we 
get 

f(S)-f(S)><P(S)-<P(S) 

> MS) 

> MS) 
- \u\ 

> OPTf(G)-l3-f(S)-A E (S) 

n 

Orr f {g)-p-f{S)-a-f{S) 



> 



n 

OPT^g) _ (3 + a 

n n 



f(S). 



The theorem now follows by applying Lemma Q with b = and a = □ 
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B Proof of Theorem 1 



It is easy to see that Q is an exact potential game, where the potential function is given by the total 
profit. In order to prove the theorem, we need to show that for each state S we have 2 ■ tp(5) + 
A(S) > Opt(£?). Consider any state S = (si, s 2 , ■ ■ ■ , s n ), and let S' = (s[, s' 2 , s' n ) be the 
state of best responses to S, that is, let s[ be the best response of player % in state S. Moreover, let 
S* = (s*, • • • > s n) be a state that maximizes the total profit. Consider a party k e M, and let 
Q k = {i e N I Si = k}, Q* k = {i e N \ s* = k}. We obtain 

jeQi 

>J2MS. 3 ,k)- Uj (S)) (2) 

jeQ* 

= Uj(S- v k) - u j( S ) 

= J2 MQkU{j})-v k (Q k ))+ Yl u 3 (S)-J2^(S) 

jeQl\Q k j&Q* k nQ k jeQ% 

> v k (Qk U (Qt \ Q k )) - v k (Q k ) - MS) (3) 

jeQl 

>v k {Ql)-v k {Q k )- ^2uj(S), (4) 

where ([2]) holds because for each player j the improvement from selecting the best response s'j 
is at least the improvement achieved by choosing the optimal strategy s* = k, §3§ follows from 
Lemma[T] whereas (|4]) holds because v k is non-decreasing. 
By summing these inequalities over all parties k, we obtain 

a(s) = a qi(s) > mod - E -EE u ^ 

keM fceM fceM feeMjeQj 

= tp(5*)-tp(5)-E tt i(' Sf ) 

>OPT(^)-2tp(S'). (5) 
where (|5]) follows from the fact that for every state S we have ^2 jeN Uj(S) < tp(5). □ 



C Proofs for Section 143 
C.1 Proof of Theorem g] 

Suppose that in some state S = {Q\, . . . , Q m ) of the game a player i that belongs to party 1 
wants to switch to party 2. Let 5" be the state after player % switches. Our goal is to show that 
Ui(S') — Ui(S) = <P(S') — <P(S), so <P is indeed a potential function of the game. 
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We can compute the utility of player i in both states, taking into account that in state S player i 
belongs to party 1 with \Qi | members, but in state S' she belongs to party 2 with \Q 2 + 1 1 members: 

Ui(s , )= m '^f-M )[ (v 2 (Qu{i.})-v 2 (Q)), 

QCQ 2 + >' 

IQIKigxl-IQI-1)! 



(S)= £ ' VIVI iq/' " ¥ WQu{i})-^)). 



The only parties whose composition changes as we move from state S to state 5" are party 1 and 
party 2. Therefore, when computing the difference between <P(S') and <P(S), we can ignore all 
other parties: 

*vn-m= E (|q| - 1 ' l ^ | -;- |ol V w) 



^ (ig 2 | + i)! 2[Q) 

QCQ 2 U{i} VIVi| 7 

V- (|Q|-l)!(|Qi|-|v1)! , , . 
~ ^ |Qi|! WJ 

^ (|g|-l)!(|Q 2 |-|Q|)! 

" ^ 2(g) 

QCQ 2 

\- /Y (|Q|-1)!(|Q 2 | + 1-|Q|)! (jgi-l;:uv2|-|vi M , Y) , 
+ (IQal + l)! WUWj 



— ^ — ) MQ) 



IQIKKfci-iW, 



-ui(QU{i}) 



nn ; — pr. — {v 2 {Q u {2}) - V 2 {Q)) 

QCQ 2 (\Ql\ + 1 ) ] 

E | Q)!(1Q | | Q |- 1)! fa(Qu{i}) _ Bi(Q)) 
= « i (S")-« i (S'). 



□ 
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C.2 Proof of Theorem g] 

Let S = (Qi, . . . , Q m ) be a Nash equilibrium state, and let S* = (Q\, . . . , Q* m ) be a state where 
the maximum total profit is achieved. It suffices to show that (1 — -)tp(5') + tp(S') > tp(S'*). 

Observe first that if \Qj\ = n for some j £ M, then S is an optimal state. Indeed, if S is not 
optimal, by the total payoff distribution property there exists a party k £ M and a player i £ Q* k 
such that Ui(S*) > Ui(S). If player i switches to party k, which currently has no members, by 
submodularity property his payoff will be at least Ui(S*), a contradiction with S being a Nash 
equilibrium state. Therefore, from now on, we assume that \Qj\ < n for all j £ M . 

Now, we have 

t P (s) = 5>(s) = 

For any j £ M and all i £ Q*, we can derive a lower bound on Ui(S). There are two cases to be 
considered. 

(1) If i £ Qj, we have 

Mj(s)= E mm-\Q\-iy. {Vj{QU{i}) _ Vj{Q)) 

QQQj\{i} 1 jl ' 

\Q\K\Qi\-\Q\) 

QcQ 3 \{i} 

(2) If i ^ Qj, we have 

QCQ 



MS)> £ (|^| + i)T feWu{'»-^W)). 



since S* is a Nash equilibrium, and hence player i cannot increase his utility by switching to 
party j. 

Changing the order of summation, by Lemma [j] we have 

t P (5) > Y, E l9 T§TTW l M Q u Q '1 ~ "'«»■ 

Set g = |Qj | . We have 

v iqikiojI - \Q\y. _ a v lom^-m _^(q y(q-i)\ _ a i 

^ (|Q,-| + 1)! ^ ^ (IQil + l)! ^-Av (<? + !)' + 1 

this identity can also be derived by considering Shapley values in an additive game with \Qj \ + 1 
players. Further, we have Vj(Q U Q*) > Vj(Q*j). Thus, 

jeM jr'eA/ QCQj UVjl -r j- 
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For any Q C Qj, we have Vj(Q) < Vj(Qj), and, moreover, Vj($) = 0. Recall also that we assume 
that \Qj\ < n for all j E M. Thus we can bound the negative term in the right-hand side of ([6]) as 

E E ^^^^-^-^^^-b^^ 

Combining @ and @, we obtain (2 - l/n)tp(S') > tp(S*). □ 
C.3 Proof of Proposition [6] 

Proof. Let t>i be an additive function given by t>i({l}) = vi({i}) = 4i for « > 2, and let 
u 2 (Q) = 1 for any Q ^0. 

The state S* = (Q\, Q* 2 ) with Q\ = {2, . . . , n}, = {1} has total profit (n- 1)^ + 1 = 2, 
which is the optimum in this game. 

On the other hand, a state S = (Qi, Q2) with Qi = {1}, Q 2 = {2, . . . , n} is a Nash equilib- 
rium. Indeed, player 1 is paid 1/n and will be paid the same amount if he switches parties, so he 
has no incentive to switch. All other players are paid -^j, and any of them will be paid the same 
amount if he switches to the first party. Therefore none of them has an incentive to switch either. 
The total profit in state S is 1 + -. There is no Nash equilibrium with a smaller total profit, because 
in any Nash equilibrium state there are players in both parties, and hence the total profit is at least 
i + I. Thus, PoA(£) = j-^r- = 2-4=-. 

n ' \ ' 1+1/n n+1 

In any Nash equilibrium, party 2 contains at least n — 2 players. Hence, the total profit in any 
Nash equilibrium is at most 4i + 1- This profit is achieved in, e.g., state S' = (Qi,Q 2 ) with 
Qi = {n - 1, n}, Q> 2 = {1, . , n - 2}. Therefore, PoS(£) = 1+2/ 2 = 2 - ^ □ 
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